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■ Abstract 
OO , 

' In this paper we use the relation of the index of an infinite aperiodic word and its 

recurrence function to give another characterization of Sturmian words. As a byproduct, 
we give a new proof of theorem describing the index of a Sturmian word in terms of the 
continued fraction expansion of its slope. This theorem was independently proved in [7] 
^ ■ and [g. 

in 

1 Introduction 

o" 

\ Sturmian words constitute the most studied example of aperiodic infinite words. For the first 

time they appeared in the paper of Morse and Hedlund in 1938 17J. But even after 70 years 
of extensive research, Sturmian words continue to attract attention of numerous mathemati- 
cians and newly also computer scientists. The appeal of Sturmian words stems in that they 
appear in various contexts. This is also why Sturmian words are often hidden under different 
titles: cutting sequences, Beatty sequences, mechanical words, etc. The beauty of Sturmian 
words consists in the abundance of equivalent definitions. Already Morse and Hedlund in [18j 
' show that Sturmian words can be characterized by the so-called balance property. The refer- 

■ ence [14] contains a nice exposition on diverse definitions of Sturmian words. The most recent 
ones, which does not mention, are characterization of Sturmian words using return words 
given by Vuillon [2lj (for less technical proof see ^), characterization using the number of 

■ palindromes of given length given in [10] and yet another characterization by Richomme [19] . 
OO . The aim of this paper is to give another equivalent definition of Sturmian words. Our 

characteristics puts into relation the recurrence function and the index of an infinite word u. 
Flagrant similarity between formulas for recurrence quotient and index of a Sturmian word 
^ . was noted already in [HOI?]. 

^ \ The recurrence function K associates to every n E N the minimal length R{n) G N such 

that arbitrary segment of the infinite word u of length R{n) contains all factors of u of length 
n. This function has been studied already by Hendlund and Morse, who gave an explicit 
formula for i?(n) for an arbitrary Sturmian word u and determined the so-called recurrence 
quotient, lim sup„^oo i2(n)/n. On the other hand, the index of an infinite word u describes 
the maximal repetition of a factor of u. The study of the index of infinite words is considerably 
younger, nevertheless, in the last decade very intense, especially due to applications in spectral 
theory for corresponding Schrodinger operators [8]. 

Repetitions in the most prominent Sturmian word, namely the Fibonacci word, were 
studied in [13]. More general results about index of Sturmian words can be found in [3l EJ El 
[T2l [T5t [T6l [20] . The complete solution to the problem was given independently by Carpi and 
de Luca in [7] and by Damanik and Lenz in [Sj. 
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The paper is organized as follows. In Section [2] we introduce all necessary notions. Sec- 
tion [3] contains the proof of the main result of the paper, namely the following theorem. 

Theorem 1.1. A uniformly recurrent infinite word u is Sturmian if and only if there exist 
infinitely many factors w of u such that 

R{\w\) = l^l ind(t(;) + 1 . 

Notation \w\ stands for the length of the factor to, and ind(w) is the maximal rational 
exponent r such that w"^ is a factor of u. 

It was pointed to us that already from [7j one can extract that Sturmian words satisfy 
the above equality for infinitely many factors. Their proof uses the explicit formula for 
recurrence function from [18]. Our proof relies on Vuillon's description of Sturmian words by 
return words and avoids manipulation with continued fraction of the slope of the Sturmian 
word. Our theorem moreover states that Sturmian words are the only having the above 
property. 

With the help of Theorem 11.11 one can derive the upper bound on the index of u (Sec- 
tion Hj). In Section [5] we prove that the bound is in fact reached. For the construction of 
factors of u with large repetition we use the knowledge of Sturmian morphisms, i.e. mor- 
phisms preserving the family of Sturmian words, as described in [3j. Sections H] and thus 
represent an alternative proof of the result of [7] and [9] . 



2 Preliminaries 

An alphabet ^ is a finite set of symbols, called A word w of length \w\ = n is a concatenation 
of n letters. The number of letters X occurring in the word w is denoted by \w\x- -A* is the 
set of all finite words over the alphabet A including the empty word e. Equipped with the 
operation of concatenation, it is a monoid. We define also infinite words u = {un)n&i £ 

A finite word v € A* is called a factor of a word w (finite or infinite) , if there exist words 
y;(i)^y;(2) ^y^Q^i ^j^at w = w^^'^ vw^'^\ If tf^^-* = €, thcu V Is Said to be a prefix of w, if w^^^ = e, 
then f is a suffix of w. The set of all factors of length n of an infinite word u is denoted by 
L„(n), the set of all factors of u is called the language of u and denoted by L(n). 

The mapping C : n i— > #L„(n) is called the complexity of the infinite word u. For 
determining the complexity of an infinite word one uses the so-called special factors. A factor 
w E L(ii) is called left special, if there exist letters A,B A, A ^ B, such that both Aw 
and Bw belong to h{u). Similarly, one defines right special factors. A factor of u is called 
bispecial, if it is in the same time right special and left special. Every eventually periodic 
word has bounded complexity. For aperiodic words, one has for all n € N that C(n) > n + 1. 
Infinite words, for which equality holds for all n S N, i.e. aperiodic words with minimal 
complexity, are called Sturmian words. Directly from the definition one can derive that in 
the language of a Sturmian word u one has exactly one left special and exactly one right 
special factor of each length, and Sturmian words are characterized by this property. 

Sturmian words are obviously defined over a binary alphabet, say {A,B}. The densities 
of letters ^, i? in a Sturmian word u = (Mi)jgN are well defined, 

q[A) = lim = a , Q{B) = hm = 1 — a , 

for some a G (0, 1). In fact, the language of a Sturmian word u depends only on the parameter 
a, which is also called the slope of u. For a given a, one can construct all Sturmian words 
with the slope a for example as codings of different orbits under an exchange of two intervals. 
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Let a G (0, 1) be an irrational number. Denote / = [0, 1) (resp. / = (0, 1]) and 1a = [0, a), 
Ib = [") 1) (resp. I A = (0, a], Ib = !])• The mapping T : I ^ I given by the prescription 



T{x) 



X + 1 — a for x G , 
X — a for X £ Ib , 



is called an exchange of two intervals with slope a. For an arbitrary xq G / we define an 
infinite word {un)nm by 

Un = Xe{A,B} if r"(xo)G/x. (1) 

It is known that the set of Sturmian words coincides with the set of infinite words given by 
the prescription ([T]). Since we assume that the slope is irrational, the language of a Sturmian 
word does not depend on the choice of the initial point xq, but only on a. Due to the 
symmetry a <-> 1 — a, studying the language of a Sturmian word, one can consider without 
loss of generality only parameters a > ^ . From the exchange of intervals is not difficult to see 
that with such an assumption, g{A) > g{B) and, in fact, the Sturmian word can be viewed 
as composed by blocks of the form A^, A*^"*"^, with k = Ltt^J; separated by single letters B. 

In this paper we study repetition of factors in Sturmian words. We say that a word v is 
a power of a word w, if |f | > \w\ and f is a prefix of the periodic word www ■ ■ ■ . We write 
V = w"^ where r = |f |/|u)|. The index of a word w in an infinite word u is defined by 

ind(tt;) = sup{r G Q | w*" G Ij{u)} . (2) 

A power v oiw with maximal r is called a maximal repetition of w. We have thus v = w™'^^^\ 
From what it was said above, it is clear that in a Sturmian word with slope a > ^, one has 

(X 

ind(S) = 1 and indM) = + 1 . (3) 

LI — aJ 

Taking supremum of indices over all factors of an infinite word li, one obtains an important 
characteristics of u, the so-called index of u. Formally, 

ind(u) = sup{ind(w) | w G L(u)} • (4) 

It turns out that for the study of index of Sturmian words, the notion of return words 
and recurrence function is important. A return word of a factor w of an infinite word u is 
a factor v G such that vw G w \s a. prefix of vw and the factor w occurs in vw 

exactly twice. The factor vw is often called a complete return word of w. The set of return 
words of a factor w is denoted by Ret(t(7). If the set Ret(tt)) is finite for any factor w of an 
infinite word li, then u is said to be uniformly recurrent. In fact, it means that distances 
between consecutive occurrences of a given factor are bounded. Let us mention that for a 
uniformly recurrent word u the supremum in ([2]) is always reached, as will be explained later, 
and therefore the notion of index of u in Q has sense. For a uniformly recurrent infinite 
word u we define a mapping ii : N i— > N by the prescription 

:= — 1 + max{ | v G Ret(it;), G L„(n)} , (5) 

i.e. R{n) + 1 is equal to the maximum of lengths of a complete return word over all factors of 
length n. It is not difficult to see that an arbitrary segment of the infinite word u of length 
R{n) contains all factors of the word u of length n. Formally, we have 

Ln('u) = {uiUiJ^i ■ ■ ■ Uj+„_i \ k < i < k + R{n) — n + 1} , for all /c G N . (6) 

Moreover, the number R{n) is the smallest possible, so that ([6]) remains valid. The mapping 
R{n) is called the recurrence function of the infinite word u. 
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3 Recurrence function and index 



Our aim is to find relation between the recurrence function (well defined for uniformly re- 
current words) and the index of aperiodic words. We first show that index of every factor in 
an aperiodic uniformly recurrent word is finite, and we then determine a lower bound on the 
recurrence function. 

Proposition 3.1. Let u he an aperiodic uniformly recurrent word. Then for every factor 
w £ L{u) we have ind(tt;) < +00 and 

R{\w\) > \w\\nd{w) + C{\w\) - \w\ . (7) 

Proof. Let w = wi ■ ■ ■ Wn^e & factor of u. We first show that ind(ti;) is finite. Without loss 
of generality, let ind(w) > 2. Obviously, all factors of the form Wi - ■ ■ WnWi ■ ■ ■ Wi-i for any 
1 < i < n belong to h{u). (Such factors are called conjugates of w.) Since C{n) > n + 1, 
there exists a factor w' which is not conjugate of w. If L(n) contained factors for all 
k € N, then distances between consecutive occurrences of w' would be unbounded, which 
would contradict uniform recurrence of u. Therefore md{w) < +00. 

Let now ?; be a maximal repetition of w. We prolong f to a factor vv' € L(n) so that 
vv' contains all C(|w|) factors of u of length \w\, but none of prefixes of vv' satisfies it. 
Since v has at most \w\ factors of length \w\, (namely the conjugates of w), we must have 
\v'\ > Cdt^l) — From the definition of the recurrence function, we have 

i?(|?i;|) > \vv'\ > \v\ + C{\w\) — \ui\ . 

As V = \w\ md{w), the proof is complete. □ 

Note that in particular, for a Sturmian word u one has -R(|tt;|) > \w\ md{w) + 1 for every 
factor w of u. The following proposition states, that if equality is reached for infinitely many 
factors w of an aperiodic word u, then u is Sturmian. 

Proposition 3.2. Let u be an aperiodic uniformly recurrent infinite word. If there exist 
infinitely many factors w S L{u) such that R{\w\) = \w\md{w) + 1, then u is a Sturmian 
word. 

Proof. Using the assumption of the proposition and ([7]), there exist infinitely many factors 
w of u such that C(|t(;|) < + 1, i.e. for infinitely many n G N we have C{n) < n + 1. The 
complexity of an aperiodic word is a strictly increasing function and C(l) > 2. This implies 
that C{n) = n-\- 1 for all n and u is therefore Sturmian. □ 

In order to show the opposite implication to that of Proposition 13.21 we need to cite a 
nice result of Vuillon [21] which characterizes Sturmian words using return words. He shows 
that a binary infinite word u is Sturmian if and only if every factor of u has exactly two 
return words. For every factor w of a Sturmian word u thus exist two finite words ro(if), 
ri(w) such that the suffix of u starting with the first occurrence of w can be written as an 
infinite concatenation of blocks rQ{w) and ri{w), i.e. 

u = p ri^^ {w)ri^ {wyi^ iw)ri^ (w) ■ ■ ■ , 

where p is a prefix of u and io,ii,i2,i3r " ^ {0, 1}- We can therefore define the so-called 
derivated word v = (f„)neN over the alphabet {0, 1} by the prescription Vn = in, coding the 
order of the blocks rQ(w), ri{w) in the infinite concatenation. We could now study return 
words of factors of the newly defined infinite word v. However, since return words of factors 
of the derivated word are in one-to-one correspondence with return words of factors in the 
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original infinite word (see [TT] ) , we deduce that every factor of v has again exactly two return 
words, and thus is itself Sturmian. 

It is obvious that for finding factors w with the maximal index in the infinite word, we 
can limit our consideration to primitive factors w, i.e. such that w ^ for any z ^L(u) and 
any /c G N, /c > 2. 

Proposition 3.3. Let u be a Sturmian word and let w G L{u) be a primitive factor such that 
WW G L{u), and, moreover, let it have the maximal index among all factors of u of length n 
with the above properties. Then 

R{n) = nmd{w) + 1 . 

Proof. Let k = [md(w)] and 9 = {md{w)}. Then w can be written as w = wiW2 where 
= On and the maximal repetition of w is the word 

{wiW2){wiW2) ■ ■ ■ {W1W2) Wl G L(u) . 

^ V ' 

k times 

Let us find X,Y € {A, B} such that 

XwiW2 • ■ ■ W1W2W1Y G L(u) . (8) 

Since md{w) = k + 9 is the greatest power such that w^~^^ G L(m), the letter Y is not a prefix 
of W2. Since w is a primitive word with the greatest index in L„(tt), the letter X is not a 
suffix of W2. This, together with the fact that k > 2, means that W1W2 = U7 is a left special 
factor and W2W1 =: w' is a right special factor. A Sturmian word has exactly one left special 
and one right special factor of each length. 

Let us consider the Rauzy graph r„ of u. The set of vertices of r„ is equal to L„('u) 
and the set of its edges to L„+i(u). The Rauzy graph r„ of a Sturmian word thus has n + 1 
vertices and n + 2 edges. An edge e G L„+i(m) starts in a vertex v G L„(u) and ends in 
v' G L„(u) if u is a prefix and v' a suffix of e. An arbitrary factor u of length m > n in the 
language of the infinite word u can be viewed as a path of length m — n in the graph r„ 
starting in the vertex corresponding to the prefix and ending in the vertex corresponding to 
the suffix of u of length n. 

Since w G L„(n), ww G L(n) and w is primitive, there exists a cycle C of length n in the 
graph r„ containing the factor w. Let us denote the vertices of the cycle C by = w, v^^\ 
. . . , t>("~^). Since r„ has n + 1 vertices, only one of them is missing in C. Let us denote it 
by Recah that w is the only left special factor in L„(m), and thus the only vertex in 

r„ with indegree 2. Similarly, w' is the only right special factor in L„(ti) and thus the only 
vertex in r„ with outdegree 2. Since is a strongly connected graph, an edge must go from 
the vertex v^"^^ to the cycle C and an edge from the cycle C to the vertex v^"^ . Thus w' = v^^^ 
for some < s < n — 1. Relation ([8|) implies that the edge from t;*^*-* to f^"^ is W2W1Y and 
the edge from to 

^;(0) is 

XwiW2. The Rauzy graph r„ is thus of the following form. 
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Let us consider the return words of w. Since ww € one of the return words of w is 

ro(w) = w, the complete return word is ww and the corresponding path in the Rauzy graph 
is the cycle C. We denote the other return word of w by ri{w). From the structure of the 
graph r„ it follows that the complete return word ri{w)w corresponds to the cycle C given 
by vertices v^^\ v^^\ . . . , v^^\ v^^\ 

As we have already mentioned, the order of the blocks tq^w), ri{w) is given by the 
derivated word over the alphabet {0, 1}, which is Sturmian. Since {rQ{w))^ = w^ (z for 
k = [ind(u;)J > 2, the derivated word has blocks 0^, O'^""'^ separated by single letters 1. As a 
consequence, among all factors of length n, it is u^") which has the longest complete return 
word, namely of the form 

X WW ■ ■ ■ w^ wiY . 

k times 

From the definition ([5]) it follows that 

R{n) = -1 + \w''+^\ +2 = l + {k + e)n, 
which completes the proof. □ 

Proof of Theorem In order to comlpete the proof of Theorem 11.11 we have to show that 
there exist infinitely many primitive factors w with index at least 2. For the construction of 
such factors we make use of bispecial factors. Let 6 be a bispecial factor in L{u). Denote by 
n its length, n := \b\ and by ro(6), ri(6) its return words. From the Rauzy graph r„ it follows 
that the two return words of b are given by the two cycles in r„, which have b as the only 
common vertex. Therefore |ro(6)| + |?^i(&)| = n + 2. Without loss of generality, let b contain 
both letters. Then > 2. At least for one of the return words, say ro(6), it holds that 

n/2 < |ro(6)| < n, and therefore ro(6) is a prefix of b. It follows that the complete return 
word ro(6)6 € L(u) has as its prefix ro(&)ro(6). Moreover, a return word of an arbitrary factor 
of any uniformly recurrent word is primitive. Thus we can take ro(&) for the desired factor w. 
Since there are infinitely many bispecial factors b, we can construct infinitely many primitive 
factors with index > 2 and length > □ 



4 Upper bound on index of Sturmian words 

In this section we mention the consequences of Proposition 13.31 which puts into relation the 
recurrence function and index of factors of a Sturmian word. In particular, we can very easily 
derive the upper bound on the index of a Sturmian word, which constitutes an alternative 
proof for the result of Damanik and Lenz [9]. The bound depends on the continued fraction 
expansion of the slope of the Sturmian word. 

Recall the notion of continued fraction. To every irrational /3 G (0, 1) one associates the 
continued fraction /3 = [0, 6i, 62, . . . ], where bi G Z,, bi > 1. Obviously, if (3 > ^, then bi = 1. 
The convergents of /3 form a sequence of fractions (|^), 

pi 1 P2 1 P3 1 



91 ^1 92 , 1 93 , 1 

bi + — bi + 

02 1 

^'3 



Pn 

It is known that the denominators qn of convergents of /3 satisfy the recurrence 



We have p„ coprime to and lim„^oo ^ = (i- 
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with initial values q-i = 0, qq = 1. Denoting the matrix Mc := (f o)' then the recurrence 
can be rewritten as 

{qN,qN-i) = {qN-i,qN-2)Mb^ , 

and by repetition, we obtain 

{qN,qN-i) = {l,0)Mb,Mb,---Mb^ 

In order to extract the component qj\f, it suffices to multiply the latter from the right by the 
vector (q). We obtain 

qN = (1, 0)Mb,Mfe, ■■■Mb^ Q = (1, 0)Mb^ ■ ■ ■ Mb.Mb, Q , (9) 

where we have used that equality must hold also for the transpose qjf = q^ and MJ = Mc 
for all c G N. 

For the derivation of the lower bound on the index of Sturmian words we use an old result 
on recurrence function of Sturmian words given in 



Theorem 4.1 ([E]). Let u be a Sturmian word with slope a. Denote by qQ, qi, q2, ■ ■ ■ the 

denominators of the convergents of a. Then for every n G N, 

R{n) = qN+i + qN + n- — 1 , where N is such that qN l^n < qN+i ■ 

Substituting into Proposition l3.3t one obtains an easy proof of the following result. Similar 
derivation one can find in [7]. 

Corollary 4.2. Index of every factor of a Sturmian word u with the slope a = [0, 1, 02, as, • • • ] 

is bounded by 

sup s 2 + ON+i H N > 1 

^ qN 

where qN are the denominators of the convergents of a 



Proof. Obviously, it suffices to consider only factors w satisfying assumptions of Theorem l3.31 
Let |tt;| = n and let qN l^n < qN+i- Using Proposition 13.31 and Theorem 14. H we have 

nind(tf) + l = R{n) = qN+i + qN + n — 1 . 

Therefore 

qN{ind{w) - l) < n(ind(u;) - l) = qN+i + qN - "2 = (aN+i + l)qN + qN-i - , 
and consequently 

ma{w) < 2 + on+i + 



qN 



□ 



5 Sturmian morphisms and factors with maximal index 

In this section we provide a lower bound on the index of a Sturmian word u of slope a. 
Obviously, ind('u) > 02 + 1, since Lrr^J ™ the formula ([3]) for the index of the letter A is 
equal to the coefficient 02 of the continued fraction of a. The idea for construction of factors 
with large index in a Sturmian word u stems in application of specific Sturmian morphisms. 
Since application of a morphism preserves repetitions, it suffices to know how the chosen 
morphism changes the slope of the Sturmian word. Let us recall the necessary facts. 
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A morphism over the alphabet {A, B} is a mapping (p : {A, B}* {A, B}* satisfying 
ip{wiW2) = ip{wi)ip{w2) . Obviously, a morphism is uniquely determined by (p[A)^ ^[B). The 
incidence matrix of a morphism (p is given by 



\^{A)\a W{A)\b 
\^{B)\a W{B)\b 



The action of a morphism can be naturally extended to infinite words by 

(f{uoUiU2 ■■■) = ip{uo)ip{ui)ip{u2) ■ ■ ■ 

It is easy to show that for the number of letters in the image of a word w, one has 

(|(/pHU, W{w)\b) = (kU, \w\b)m^ . (10) 

From that, we can deduce the following fact for the densities of letters in an infinite word 
u. If q{A), q{B) are the densities in u, than the densities in the word u' = (p{u) are g'{A), 
q'{B), where 

{g'{A), q'{B)) = const. {q{A), q{B))M^ , (11) 

and const, is chosen so that q'{A) + q'{B) = 1. 

A morphism ip is called Sturmian, if ip{u) is a Sturmian word for every Sturmian word u. 
Obviously, the set of Sturmian morphisms equipped with the operation of composition is a 
monoid, denoted by St. It is known [4J that the monoid St has three generators, namely 

A ^ AB A ^ BA a ^ B 

B ^ B B ^ B B ^ A ^^^^ 

Consider a Sturmian word with slope j3 G (^, 1) whose continued fraction is of the form 
f3 = [0, 1, &2i &3i • • • ]• For c G N, we shall study the action of the morphism 

A ^ A'B 

on the Sturmian word u with slope (3. The morphism 99 is a Sturmian morphism; it is a com- 
position of the generators (I12p of the Sturmian monoid, namely 99 = Eip2- The corresponding 
incidence matrix is M^p = Mc = (fo)) ^ defined in the Preliminaries. Consequently, the 
infinite word fiu) is also Sturmian, i.e. there exists an irrational such that u' := (p{u) is a 
Sturmian word with slope /?'. According to (|lip . the densities of letters a,b in the word u' 
satisfy 

iP', 1-/3') = const. (/3, 1 - /3) ( ^ J ) . 
Therefore /?' = ^^^^^^ • It is not difficult to show that the continued fraction of /?' is equal to 

/?' = [0,1,0,62,63, •••]• (14) 

The following lemma is crucial for construction of factors of a Sturmian word with maximal 
index. 

Lemma 5.1. Let u be a Sturmian word with slope (3 having the continued fraction (3 = 
[0, 1, 62, 63, • • • ]• Let w € L{u), and let r £ Q, r > 2 be such that v = £ L(u). Denote 

w' = (p{w) and v' = ip{v)A'^ , 

where (p is the morphism given by (fT3]) . Then v' is a rational power of w' in a Sturmian word 
u' with slope /?' = [0, 1, c, 62, 63, . . . ]. 
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Proof. If \w\ = 1, then necessarily w = A, v = A"^ for 2 < r < 62 + 1, f{w) = A'^B, and 
(p{v)A'^ = (A'^ByA^ is a factor of u', since a Sturmian word with slope /?' = [0, 1, c, 62, ^3, • • • ] 
has blocks A'^, A'^^^ separated by single letters B. 

If \w\ > 2, let us write w = W1W2 so that W2 ^ e and v = {wiW2)^'^^wi. Then ip{v)A'^ = 
(p{w^'^^)(p{wi)A'^. In order to show that (p{v)A'^ is a power of f{w), it suffices to show that 
93(^1)^'^ is a prefix of or ip{w)ip{w) . If i(;2 starts with A or 5^4, then 92(^2) has prefix 

and thus 93(^1)^'^ is a prefix of ^{w) = ip{wi)ip{w2). Since BB ^ it remains to 

discuss the special case when W2 = B. As \w\ > 2, we have wi ^ e. Since W2W1 G L(n), the 
word must start with the letter A and therefore ^p{vJi)A^ is a prefix of (p{wiB)(p{wiB) = 

Theorem 5.2. Lei u be a Sturmian word with slope a = [0,1,02,03,...]. Then for every 
G N there exists a factor w € L{u) with index at least equal to 2 + a^+i + "^^^^ ^ , where 
qn is the denominator of the N-th convergent of a. 

Proof. For = 1 it follows from the continued fraction of a that gi = 1, go = 1 aiid therefore 
we have to find a factor with index 2 + 02 — 1 = 02 + 1. It suffices to put w = A. Therefore 
we consider N >2. We shall construct the desired factor w and its power v by (A^ — l)-fold 
application of Lemma 15.11 Consider the irrational number oq with the continued fraction 
ao = [0) 1) fl7V+i) aAf+2....]- Take a Sturmian word u^^^ with slope ag and its factors w^^'^ := A, 
yio) ^1+ajv+i £qj, i]2itial values of the construction. For 1 < i < A^ — 1, define 

v^''> := ^i{v^'-^^)A^^-^+\ where 97^:^^ ^"^-.+15^ 

By Lemma |5.H the word w^'^^ is a factor of a Sturmian word u^*) with slope ai, where Oi has 
the continued fraction = [0, 1, aAr+i_j, a7v+2-i) • • • ] and v^^^ is a power of w^'^^ in the word 
In particular, w^^~^^ is a factor of a Sturmian word u with slope a = [0, 1, 02, 03, . . . ] 
and v^^~^^ is its power in u. 

It suffices now to show that the length of w^^~^^ is qn and the length of v^^~^^ is 
(2 + aN+i)qN + ^Af-i — 2. For the recurrent expression of lengths of factors w^'^\ v^^^ we use 
formula (1101). We have 



for alH = 1, 2, . . . , A^ — 1, with (^\w^^^ \a, \w^'^^ \b) = (1, 0). It can be easily seen that 

In order to obtain \w^^^^^ \ = \w^^~^^ \a + \w^^~^'^ \b, we multiply the latter from the right by 
the vector (j^) , which can be also written as (J) = (10) (0) ■ Since in the continued fraction 
of a we have oi = 1, we can use ([9]) to obtain 

|^(7V-l)| = {l,0)Ma^Ma^_, ■ ■ ■ Ma.MaXo) = QN . 

From the definition of words f we have for their lengths 

{\v^'^\a,\v(^^\b) = {\v^'-'^\A,\v^'-'^\B)Ma^_^^, + {aN-^+l,0) , (15) 

with (Iu^'^^Iai I'w'-'^^Ib) = (1 + aAr+i,0). Let us compute the lengths for A^ = 1, 
{\v^^^a,\v^^^b) = (l + a;v+i,0)(7 1) + (ajv,0) = (2 + aiv+i)(l, 0)M,^ + (1, 0) - (1,1). 
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Since for every c we have —(1, l)Mc + (c, 0) = — (1, 1), by repeated application of the recur- 
rence (jlSp we obtain 

(|i;(^-i)U, \b) = (2 + a^+i)(l, 0)M„^M„^_, • • • + (1, 0)M„^_, • • • - (1, 1) . 

Again, multiplying the latter from the right by the vector (|) = M^^ (q) and using Q, we 
obtain 

\v(^-^) I = (2 + aN+i)qN + QN-i - 2 . 

□ 

6 Acknowledgements 

We are grateful to J. -P. Allouche for pointing out the reference [7j. We also acknowledge 
financial support by the grants MSM6840770039 and LC06002 of the Ministry of Education, 
Youth, and Sports of the Czech Republic. 

References 

[1] B. Adamczewski and J. -P. Allouche, Reversals and palindromes in continued fractions, 
Theoret. Comput. Sci. 380 (2007), 220-237. 

[2] L. Balkova, E. Pelantova, and W. Steiner, Sequences with constant number of return 
words, Monatsh. Math. (2008). 

[3] J. Berstel, On the index of Sturmian words. In ^Jewels are forever', Springer (1999), 
287-294. 

[4] J. Berstel and P. Seebold, Morphismes de Sturm, Bull. Belg. Math. Soc. Simon Stevin 
1 (1994), 175-189. 

[5] V. Berthe, C. Holton, and L. Q. Zamboni, Initial powers of Sturmian sequences. Acta 
Arith. 122 (2006), 315-347. 

[6] W.-T. Cao and Z.-Y. Wen, Some properties of the factors of Sturmian sequences, The- 
oret. Comput. Sci. 304 (2003), 365-385. 

[7] A. Carpi and A. de Luca, Special factors, periodicity, and an application to Sturmian 
words. Acta Inform. 36 (2000), 983-1006. 

[8] D. Damanik, Singular continuous spectrum for a class of substitution Hamiltonians. II, 
Lett. Math. Phys. 54 (2000), 25-31. 

[9] D. Damanik and D. Lenz, The index of Sturmian sequences, European J. Combin. 23 
(2002), 23-29. 

[10] X. Droubay and G. Pirillo, Palindromes and sturmian words, Theor. Comput. Sci. 223 
(1999), 73-85. 

[11] F. Durand, A characterization of substitutive sequences using return words. Discrete 
Math. 179 (1998), 89-101. 

[12] J. Justin and G. Pirillo, Fractional powers in Sturmian words, Theoret. Comput. Sci. 
255 (2001), 363-376. 



10 



[13] J. Karhumaki, On cube-free a;-words generated by binary morphisms, Discrete Appl. 
Math. 5 (1983), 279-297. 

[14] M. Lothaire, Algebraic combinatorics on words, volume 90 of Encyclopedia of Mathe- 
matics and its Applications, Cambridge University Press, Cambridge, (2002). 

[15] F. Mignosi and G. Pirillo, Repetitions in the Fibonacci infinite word, RAIRO Inform. 
Theor. Appl. 26 (1992), 199-204. 

[16] F. Mignosi, Infinite words with hnear subword complexity, Theoret. Comput. Sci. 65 
(1989), 221-242. 

[17] M. Morse and G. A. Hedlund, Symbolic Dynamics, Amer. J. Math. 60 (1938), 815-866. 

[18] M. Morse and G. A. Hedlund, Symbolic dynamics II. Sturmian trajectories, Amer. J. 
Math. 62 (1940), 1-42. 

[19] G. Richomme, Another characterization of Sturmian words (one more). Bull. Eur. 
Assoc. Theor. Comput. Sci. EATCS 67 (1999), 173-175. 

[20] D. Vandeth, Sturmian words and words with a critical exponent, Theoret. Comput. Sci. 
242 (2000), 283-300. 

[21] L. Vuillon, A characterization of Sturmian words by return words, European J. Combin. 
22 (2001), 263-275. 



11 



